Experiments in Parameter Learning Using Temporal Differences
نویسندگان
چکیده
In this paper we discuss the problem of automatically learning evaluation function parameters in a chess program. In particular, we describe some experiments in which our chess program KnightCap learnt the parameters of its evaluation function using a combination of Temporal Difference learning and on-line play on FICS and ICC. KnightCap is freely available on the web from http://wwwsyseng.anu.edu.au/lsg. The main success we report is that KnightCap went from a (blitz) rating of 1650 to a rating of 2150 in just 3 days and 308 games. We discuss the details of our learning algorithm, details of KnightCap, and some of the principal reasons for KnightCap’s rapid improvement.
منابع مشابه
Temporal variation of interrill erosion under different rainfall intensities in semi-arid soils
Interrill erosion can change during rainfall due to change of soil conditions. This research was carried out to investigate the temporal variation of interrill erosion in different soils under different rainfall intensities. For this purpose, laboratory experiments were carried out in three soil textures consists under three simulated rainfalls under slope of 10 percent in three replications...
متن کاملThe Effect of Withania somnifera Alcoholic Extract on Learning and Memory Disturbance in a Model of Temporal Lobe Epilepsy in the Rat
Background and Objective: Temporal lobe epilepsy (TLE) usually leads to memory deficit. In this study, we tried to assess the effect of Withania somnifera extract on the impaired learning and memory in the intrahippocampal kainate model of TLE in the rat. Materials & Methods: Male rats (n=32) were divided into sham, extract+sham, kainite, and kainite+extract. For induction of epilepsy, unilate...
متن کاملParameter Analysis and optimization of equal channel angular pressing extrusion for titanium alloy using Taguchi design of experiments method
In this paper the influence of different parameters on equal channel angular pressing (EADAP) of titanium alloy is investigated. In the first step the most important parameters are selected, and then a table of experiments is designed using Taguchi method. After designing the table of experiments, all of the experiments are simulated using Abacus software and the results are optimized using Tag...
متن کاملContextual Interference Effect in Bandwidth and Self-Control Feedback Conditions on Relative and Absolute Timing Learning
This study aims to better understand the effect of practice schedule and feedback providing types. In two separate experiments the contextual interference effect in bandwidth and self-control feedback conditions on relative and absolute timing learning was examined. In experiment I, the effect of contextual interference using bandwidth and self-control feedback on absolute timing learning (para...
متن کاملKinematics parameter extraction of longitudinal movement of common carotid arterial wall in healthy and atherosclerotic subjects based on consecutive ultrasonic image processing
Introduction:In this study, a non-invasive method based on consecutive ultrasonic image processing is introduced to assess time rate changes of the carotid artery wall displacement, velocity and acceleration in the longitudinal direction. The application of these parameters to discriminate healthy and atherosclerotic arteries was evaluated. Methods:Longitudinal displacement rate of common ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- ICGA Journal
دوره 21 شماره
صفحات -
تاریخ انتشار 1998